Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 7669950 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 702.2 MiB |
| Average record size in memory | 96.0 B |
Variable types
| NUM | 6 |
|---|---|
| CAT | 6 |
user_id has a high cardinality: 228554 distinct values | High cardinality |
date_from has a high cardinality: 7205513 distinct values | High cardinality |
date_until has a high cardinality: 7194377 distinct values | High cardinality |
start_station_name has a high cardinality: 208 distinct values | High cardinality |
end_station_name has a high cardinality: 208 distinct values | High cardinality |
booked_via has a high cardinality: 223 distinct values | High cardinality |
date_from is uniformly distributed | Uniform |
date_until is uniformly distributed | Uniform |
df_index has unique values | Unique |
distance_in_km has 293637 (3.8%) zeros | Zeros |
Reproduction
| Analysis started | 2021-03-13 12:19:10.383402 |
|---|---|
| Analysis finished | 2021-03-13 12:26:04.082556 |
| Duration | 6 minutes and 53.7 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 7669950 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8056972.421 |
|---|---|
| Minimum | 0 |
| Maximum | 16228295 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 58.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 643750.45 |
| Q1 | 3943278.25 |
| median | 8024147.5 |
| Q3 | 12252320.5 |
| 95-th percentile | 15458150.55 |
| Maximum | 16228295 |
| Range | 16228295 |
| Interquartile range (IQR) | 8309042.25 |
Descriptive statistics
| Standard deviation | 4774769.999 |
|---|---|
| Coefficient of variation (CV) | 0.5926258338 |
| Kurtosis | -1.220860839 |
| Mean | 8056972.421 |
| Median Absolute Deviation (MAD) | 4152774 |
| Skewness | 0.01473336482 |
| Sum | 6.179657562e+13 |
| Variance | 2.279842854e+13 |
| Monotocity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 13930435 | 1 | < 0.1% | |
| 14004199 | 1 | < 0.1% | |
| 9807846 | 1 | < 0.1% | |
| 5619685 | 1 | < 0.1% | |
| 13996003 | 1 | < 0.1% | |
| 9725918 | 1 | < 0.1% | |
| 5537757 | 1 | < 0.1% | |
| 13905879 | 1 | < 0.1% | |
| 9701330 | 1 | < 0.1% | |
| Other values (7669940) | 7669940 | > 99.9% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 16228295 | 1 | < 0.1% | |
| 16228293 | 1 | < 0.1% | |
| 16228291 | 1 | < 0.1% | |
| 16228290 | 1 | < 0.1% | |
| 16228289 | 1 | < 0.1% |
bike_id
Real number (ℝ≥0)
| Distinct | 2681 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 121288.1916 |
|---|---|
| Minimum | 106022 |
| Maximum | 143866 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 58.5 MiB |
Quantile statistics
| Minimum | 106022 |
|---|---|
| 5-th percentile | 108385 |
| Q1 | 116498 |
| median | 119919 |
| Q3 | 120512 |
| 95-th percentile | 143722 |
| Maximum | 143866 |
| Range | 37844 |
| Interquartile range (IQR) | 4014 |
Descriptive statistics
| Standard deviation | 10746.43914 |
|---|---|
| Coefficient of variation (CV) | 0.08860251765 |
| Kurtosis | 0.4183371185 |
| Mean | 121288.1916 |
| Median Absolute Deviation (MAD) | 893 |
| Skewness | 1.144872394 |
| Sum | 9.302743654e+11 |
| Variance | 115485954.2 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 143619 | 4872 | 0.1% | |
| 116251 | 4868 | 0.1% | |
| 119685 | 4800 | 0.1% | |
| 108483 | 4792 | 0.1% | |
| 119882 | 4744 | 0.1% | |
| 143694 | 4727 | 0.1% | |
| 120000 | 4723 | 0.1% | |
| 119612 | 4721 | 0.1% | |
| 107509 | 4704 | 0.1% | |
| 107353 | 4700 | 0.1% | |
| Other values (2671) | 7622299 | 99.4% |
| Value | Count | Frequency (%) | |
| 106022 | 71 | < 0.1% | |
| 106025 | 1296 | < 0.1% | |
| 106033 | 185 | < 0.1% | |
| 106035 | 628 | < 0.1% | |
| 106040 | 851 | < 0.1% |
| Value | Count | Frequency (%) | |
| 143866 | 1697 | < 0.1% | |
| 143855 | 363 | < 0.1% | |
| 143833 | 1796 | < 0.1% | |
| 143832 | 2477 | < 0.1% | |
| 143831 | 4379 | 0.1% |
| Distinct | 228554 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 58.5 MiB |
| 496D35CFE3F625730E578793269E52D0A45FE53E | 2714 |
|---|---|
| 6DF3E96544415EBFF474F968F264E144772F508E | 2583 |
| 7439201395BB2E80301974D4D00100F1F8A7AFB4 | 2299 |
| 5EBBA60A2178EF837A2A2065E05B1A84C9B4FD94 | 2258 |
| F4B0220EB708EB3C7D2966B0194FA19640B458C5 | 2167 |
| Other values (228549) |
| Value | Count | Frequency (%) | |
| 496D35CFE3F625730E578793269E52D0A45FE53E | 2714 | < 0.1% | |
| 6DF3E96544415EBFF474F968F264E144772F508E | 2583 | < 0.1% | |
| 7439201395BB2E80301974D4D00100F1F8A7AFB4 | 2299 | < 0.1% | |
| 5EBBA60A2178EF837A2A2065E05B1A84C9B4FD94 | 2258 | < 0.1% | |
| F4B0220EB708EB3C7D2966B0194FA19640B458C5 | 2167 | < 0.1% | |
| B55462DA30B9D64E617B92DF0A99AC509BCC461B | 2100 | < 0.1% | |
| 19C08F00C4101E327BF935F49D228C5398AA9F06 | 2001 | < 0.1% | |
| 63D3262EA34B00E18F9A801AE1832C618FD70D49 | 1946 | < 0.1% | |
| D56E514389AF41CEE25EB57352A9CAC5D7371006 | 1944 | < 0.1% | |
| BDBE0F11FE2C06152C2D97FF4B02E02D1D962C6E | 1938 | < 0.1% | |
| Other values (228544) | 7648000 | 99.7% |
Frequencies of value counts
Unique
| Unique | 26999 ? |
|---|---|
| Unique (%) | 0.4% |
Histogram of lengths of the category
Length
| Max length | 40 |
|---|---|
| Median length | 40 |
| Mean length | 40 |
| Min length | 40 |
| Distinct | 7205513 |
|---|---|
| Distinct (%) | 93.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 58.5 MiB |
| 2016-07-21 20:05:02 | 6 |
|---|---|
| 2014-07-01 00:39:04 | 6 |
| 2017-05-15 17:52:53 | 6 |
| 2015-09-08 17:39:51 | 6 |
| 2016-06-01 17:07:51 | 6 |
| Other values (7205508) |
| Value | Count | Frequency (%) | |
| 2016-07-21 20:05:02 | 6 | < 0.1% | |
| 2014-07-01 00:39:04 | 6 | < 0.1% | |
| 2017-05-15 17:52:53 | 6 | < 0.1% | |
| 2015-09-08 17:39:51 | 6 | < 0.1% | |
| 2016-06-01 17:07:51 | 6 | < 0.1% | |
| 2014-08-05 17:01:52 | 5 | < 0.1% | |
| 2016-08-14 14:29:04 | 5 | < 0.1% | |
| 2016-07-26 18:26:49 | 5 | < 0.1% | |
| 2017-03-28 18:13:37 | 5 | < 0.1% | |
| 2016-07-21 19:14:48 | 5 | < 0.1% | |
| Other values (7205503) | 7669895 | > 99.9% |
Frequencies of value counts
Unique
| Unique | 6766286 ? |
|---|---|
| Unique (%) | 88.2% |
Histogram of lengths of the category
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
| Distinct | 7194377 |
|---|---|
| Distinct (%) | 93.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 58.5 MiB |
| 2017-04-02 16:05:32 | 6 |
|---|---|
| 2016-05-10 17:44:08 | 5 |
| 2016-06-05 15:46:28 | 5 |
| 2015-07-23 19:23:26 | 5 |
| 2014-08-13 18:57:28 | 5 |
| Other values (7194372) |
| Value | Count | Frequency (%) | |
| 2017-04-02 16:05:32 | 6 | < 0.1% | |
| 2016-05-10 17:44:08 | 5 | < 0.1% | |
| 2016-06-05 15:46:28 | 5 | < 0.1% | |
| 2015-07-23 19:23:26 | 5 | < 0.1% | |
| 2014-08-13 18:57:28 | 5 | < 0.1% | |
| 2015-04-23 17:48:13 | 5 | < 0.1% | |
| 2017-03-28 17:51:08 | 5 | < 0.1% | |
| 2016-05-27 17:50:22 | 5 | < 0.1% | |
| 2015-06-12 22:13:19 | 5 | < 0.1% | |
| 2016-09-19 17:35:27 | 5 | < 0.1% | |
| Other values (7194367) | 7669899 | > 99.9% |
Frequencies of value counts
Unique
| Unique | 6745259 ? |
|---|---|
| Unique (%) | 87.9% |
Histogram of lengths of the category
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
| Distinct | 208 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 58.5 MiB |
| Allende-Platz/Grindelhof | 157780 |
|---|---|
| Schulterblatt/Eifflerstraße | 145002 |
| Mundsburg / Schürbeker Straße | 112640 |
| Goldbekplatz / Semperstraße | 111603 |
| Lange Reihe / Lohmühlenpark | 108510 |
| Other values (203) |
| Value | Count | Frequency (%) | |
| Allende-Platz/Grindelhof | 157780 | 2.1% | |
| Schulterblatt/Eifflerstraße | 145002 | 1.9% | |
| Mundsburg / Schürbeker Straße | 112640 | 1.5% | |
| Goldbekplatz / Semperstraße | 111603 | 1.5% | |
| Lange Reihe / Lohmühlenpark | 108510 | 1.4% | |
| Jungfernstieg / Ballindamm | 108407 | 1.4% | |
| Jarrestraße / Rambatzweg | 104656 | 1.4% | |
| Neuer Pferdemarkt / Beim Grünen Jäger | 103806 | 1.4% | |
| Paulinenplatz/Wohlwillstraße | 101823 | 1.3% | |
| Eduard-Rhein-Ufer / Schwanenwik | 100546 | 1.3% | |
| Other values (198) | 6515177 | 84.9% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 48 |
|---|---|
| Median length | 28 |
| Mean length | 28.92616262 |
| Min length | 14 |
start_station_id
Real number (ℝ≥0)
| Distinct | 208 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 169794.3685 |
|---|---|
| Minimum | 131543 |
| Maximum | 268358 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 58.5 MiB |
Quantile statistics
| Minimum | 131543 |
|---|---|
| 5-th percentile | 131641 |
| Q1 | 131885 |
| median | 140796 |
| Q3 | 211706 |
| 95-th percentile | 244935 |
| Maximum | 268358 |
| Range | 136815 |
| Interquartile range (IQR) | 79821 |
Descriptive statistics
| Standard deviation | 41570.88457 |
|---|---|
| Coefficient of variation (CV) | 0.2448307616 |
| Kurtosis | -1.316271791 |
| Mean | 169794.3685 |
| Median Absolute Deviation (MAD) | 9154 |
| Skewness | 0.5420001566 |
| Sum | 1.302314317e+12 |
| Variance | 1728138444 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 198077 | 157780 | 2.1% | |
| 131648 | 145002 | 1.9% | |
| 140799 | 112640 | 1.5% | |
| 140796 | 111603 | 1.5% | |
| 138385 | 108510 | 1.4% | |
| 131879 | 108407 | 1.4% | |
| 138376 | 104656 | 1.4% | |
| 131890 | 103806 | 1.4% | |
| 131547 | 101823 | 1.3% | |
| 140800 | 100546 | 1.3% | |
| Other values (198) | 6515177 | 84.9% |
| Value | Count | Frequency (%) | |
| 131543 | 90273 | 1.2% | |
| 131546 | 52956 | 0.7% | |
| 131547 | 101823 | 1.3% | |
| 131639 | 72443 | 0.9% | |
| 131640 | 21095 | 0.3% |
| Value | Count | Frequency (%) | |
| 268358 | 81 | < 0.1% | |
| 264821 | 2964 | < 0.1% | |
| 264820 | 6686 | 0.1% | |
| 264330 | 1222 | < 0.1% | |
| 256467 | 5401 | 0.1% |
| Distinct | 208 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 58.5 MiB |
| Allende-Platz/Grindelhof | 161211 |
|---|---|
| Schulterblatt/Eifflerstraße | 145284 |
| Jungfernstieg / Ballindamm | 114148 |
| Goldbekplatz / Semperstraße | 113991 |
| Mundsburg / Schürbeker Straße | 113893 |
| Other values (203) |
| Value | Count | Frequency (%) | |
| Allende-Platz/Grindelhof | 161211 | 2.1% | |
| Schulterblatt/Eifflerstraße | 145284 | 1.9% | |
| Jungfernstieg / Ballindamm | 114148 | 1.5% | |
| Goldbekplatz / Semperstraße | 113991 | 1.5% | |
| Mundsburg / Schürbeker Straße | 113893 | 1.5% | |
| Lange Reihe / Lohmühlenpark | 109179 | 1.4% | |
| Jarrestraße / Rambatzweg | 108136 | 1.4% | |
| Neuer Pferdemarkt / Beim Grünen Jäger | 103347 | 1.3% | |
| Paulinenplatz/Wohlwillstraße | 103000 | 1.3% | |
| Eduard-Rhein-Ufer / Schwanenwik | 102102 | 1.3% | |
| Other values (198) | 6495659 | 84.7% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 48 |
|---|---|
| Median length | 28 |
| Mean length | 28.9019579 |
| Min length | 14 |
end_station_id
Real number (ℝ≥0)
| Distinct | 208 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 170111.856 |
|---|---|
| Minimum | 131543 |
| Maximum | 268358 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 58.5 MiB |
Quantile statistics
| Minimum | 131543 |
|---|---|
| 5-th percentile | 131641 |
| Q1 | 131887 |
| median | 140796 |
| Q3 | 211706 |
| 95-th percentile | 244935 |
| Maximum | 268358 |
| Range | 136815 |
| Interquartile range (IQR) | 79819 |
Descriptive statistics
| Standard deviation | 41577.36347 |
|---|---|
| Coefficient of variation (CV) | 0.244411909 |
| Kurtosis | -1.327636139 |
| Mean | 170111.856 |
| Median Absolute Deviation (MAD) | 9154 |
| Skewness | 0.5283857597 |
| Sum | 1.30474943e+12 |
| Variance | 1728677153 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 198077 | 161211 | 2.1% | |
| 131648 | 145284 | 1.9% | |
| 131879 | 114148 | 1.5% | |
| 140796 | 113991 | 1.5% | |
| 140799 | 113893 | 1.5% | |
| 138385 | 109179 | 1.4% | |
| 138376 | 108136 | 1.4% | |
| 131890 | 103347 | 1.3% | |
| 131547 | 103000 | 1.3% | |
| 140800 | 102102 | 1.3% | |
| Other values (198) | 6495659 | 84.7% |
| Value | Count | Frequency (%) | |
| 131543 | 96289 | 1.3% | |
| 131546 | 53211 | 0.7% | |
| 131547 | 103000 | 1.3% | |
| 131639 | 76155 | 1.0% | |
| 131640 | 20242 | 0.3% |
| Value | Count | Frequency (%) | |
| 268358 | 72 | < 0.1% | |
| 264821 | 2968 | < 0.1% | |
| 264820 | 6607 | 0.1% | |
| 264330 | 1087 | < 0.1% | |
| 256467 | 5497 | 0.1% |
| Distinct | 223 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 58.5 MiB |
| iPhone SRH | |
|---|---|
| Android SRH | |
| IVR | |
| iPhone CAB | 210597 |
| Android CAB | 82403 |
| Other values (218) |
| Value | Count | Frequency (%) | |
| iPhone SRH | 2012624 | 26.2% | |
| Android SRH | 1460250 | 19.0% | |
| IVR | 763349 | 10.0% | |
| iPhone CAB | 210597 | 2.7% | |
| Android CAB | 82403 | 1.1% | |
| Unknown | 77217 | 1.0% | |
| terminal HH_93 (-2215-) | 65395 | 0.9% | |
| Terminal HH_5 (-2132-) | 51094 | 0.7% | |
| Terminal HH_79 (-2323-) | 48210 | 0.6% | |
| Terminal HH_75 (-2364-) | 44666 | 0.6% | |
| Other values (213) | 2854145 | 37.2% |
Frequencies of value counts
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Histogram of lengths of the category
Length
| Max length | 28 |
|---|---|
| Median length | 11 |
| Mean length | 14.68398999 |
| Min length | 3 |
duration_in_min
Real number (ℝ≥0)
| Distinct | 33 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.05734509 |
|---|---|
| Minimum | 1 |
| Maximum | 33 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 58.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 8 |
| median | 13 |
| Q3 | 19 |
| 95-th percentile | 28 |
| Maximum | 33 |
| Range | 32 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 7.37423814 |
|---|---|
| Coefficient of variation (CV) | 0.5245825645 |
| Kurtosis | -0.4793869522 |
| Mean | 14.05734509 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.6106385271 |
| Sum | 107819134 |
| Variance | 54.37938814 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=33)
| Value | Count | Frequency (%) | |
| 8 | 451009 | 5.9% | |
| 7 | 448232 | 5.8% | |
| 9 | 442258 | 5.8% | |
| 6 | 430248 | 5.6% | |
| 10 | 427810 | 5.6% | |
| 11 | 408080 | 5.3% | |
| 12 | 384119 | 5.0% | |
| 5 | 364379 | 4.8% | |
| 13 | 361730 | 4.7% | |
| 14 | 338326 | 4.4% | |
| Other values (23) | 3613759 | 47.1% |
| Value | Count | Frequency (%) | |
| 1 | 992 | < 0.1% | |
| 2 | 9676 | 0.1% | |
| 3 | 145243 | 1.9% | |
| 4 | 260936 | 3.4% | |
| 5 | 364379 | 4.8% |
| Value | Count | Frequency (%) | |
| 33 | 53896 | 0.7% | |
| 32 | 61895 | 0.8% | |
| 31 | 72807 | 0.9% | |
| 30 | 82244 | 1.1% | |
| 29 | 92964 | 1.2% |
| Distinct | 13808 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.879521843 |
|---|---|
| Minimum | 0 |
| Maximum | 19.19140257 |
| Zeros | 293637 |
| Zeros (%) | 3.8% |
| Memory size | 58.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.4124845333 |
| Q1 | 0.9761789274 |
| median | 1.598932269 |
| Q3 | 2.583083313 |
| 95-th percentile | 4.240455163 |
| Maximum | 19.19140257 |
| Range | 19.19140257 |
| Interquartile range (IQR) | 1.606904385 |
Descriptive statistics
| Standard deviation | 1.21018557 |
|---|---|
| Coefficient of variation (CV) | 0.6438794923 |
| Kurtosis | 0.7775827794 |
| Mean | 1.879521843 |
| Median Absolute Deviation (MAD) | 0.742906618 |
| Skewness | 0.933151183 |
| Sum | 14415838.56 |
| Variance | 1.464549114 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 293637 | 3.8% | |
| 0.6899266912 | 35324 | 0.5% | |
| 0.4670319582 | 28565 | 0.4% | |
| 0.7730821796 | 23199 | 0.3% | |
| 0.838103895 | 21517 | 0.3% | |
| 0.6455644214 | 21438 | 0.3% | |
| 1.428854315 | 20358 | 0.3% | |
| 0.5965872349 | 19770 | 0.3% | |
| 0.8582389897 | 19528 | 0.3% | |
| 1.02143741 | 19000 | 0.2% | |
| Other values (13798) | 7167614 | 93.5% |
| Value | Count | Frequency (%) | |
| 0 | 293637 | 3.8% | |
| 0.08803222592 | 296 | < 0.1% | |
| 0.1058156515 | 1043 | < 0.1% | |
| 0.1215029514 | 260 | < 0.1% | |
| 0.1298428182 | 341 | < 0.1% |
| Value | Count | Frequency (%) | |
| 19.19140257 | 1 | < 0.1% | |
| 18.40713798 | 1 | < 0.1% | |
| 14.9559601 | 1 | < 0.1% | |
| 14.78695103 | 1 | < 0.1% | |
| 13.78492366 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | bike_id | user_id | date_from | date_until | start_station_name | start_station_id | end_station_name | end_station_id | booked_via | duration_in_min | distance_in_km | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 143517 | A821059B555C7764A2FF801180874A2FCB326222 | 2014-01-01 00:34:54 | 2014-01-01 00:50:14 | U-Bahn Baumwall | 214170 | Mönckebergstraße / Rosenstraße | 131880 | iPhone SRH | 16 | 1.293661 |
| 1 | 1 | 119830 | 1EBC930DB407ACEAE2FDE23A6CA40492EA3DFBB2 | 2014-01-01 01:39:55 | 2014-01-01 01:57:27 | Bahnhof Altona Ost/Max-Brauer-Allee | 131646 | Schulterblatt/Eifflerstraße | 131648 | Android SRH | 18 | 2.032271 |
| 2 | 2 | 143501 | 7AD2C1B70137479062A6DD73815835986677BB2D | 2014-01-01 01:40:20 | 2014-01-01 01:53:09 | Weidestraße/Biedermannplatz | 211922 | Jarrestraße / Rambatzweg | 138376 | Techniker HH_119 (-2334-) | 13 | 0.954178 |
| 3 | 4 | 108641 | 4F4F752203EA6FC872D576E9289C4E1B362E16F6 | 2014-01-01 02:05:55 | 2014-01-01 02:13:49 | Mundsburg / Schürbeker Straße | 140799 | Bartholomäusstraße/Beim Alten Schützenhof | 211923 | iPhone SRH | 8 | 0.693159 |
| 4 | 5 | 143829 | FEA7FF33A3252EE99E58B9E15724AA861CAB1DDF | 2014-01-01 02:29:03 | 2014-01-01 02:32:41 | Krausestraße/Eilbektal | 208295 | Lortzingstraße/Friedrichsberger Straße | 213833 | iPhone SRH | 4 | 0.645564 |
| 5 | 6 | 143552 | 60A788942F6A49BF54DB9013DB05428F897FCCCE | 2014-01-01 03:07:07 | 2014-01-01 03:20:08 | Winterhuder Weg/ Zimmerstraße | 208292 | Wiesendamm/Roggenkamp | 212607 | Android SRH | 14 | 1.977492 |
| 6 | 7 | 120327 | E32FF481BF244603D691DED875AC4FBEDCF96BFB | 2014-01-01 03:12:50 | 2014-01-01 03:14:54 | Bahnhof Altona Ost/Max-Brauer-Allee | 131646 | Bahnhof Altona Ost/Max-Brauer-Allee | 131646 | Terminal HH_55 (-2121-) | 3 | 0.000000 |
| 7 | 9 | 143577 | 708275C3A732D3BD47E97F1E0AC3AE01735FA170 | 2014-01-01 04:27:51 | 2014-01-01 04:45:18 | Hofweg/Am Langenzug | 200502 | Eppendorfer Weg/Hoheluftchaussee | 198086 | Android SRH | 18 | 2.830757 |
| 8 | 10 | 143580 | 4FCAC2DAFF984CC2FFC85D0B87D577D266010745 | 2014-01-01 04:58:33 | 2014-01-01 05:12:31 | Löwenstraße/Eppendorfer Weg | 213680 | Heußweg/Wiesenstraße | 201326 | Techniker HH_138 (-2244-) | 14 | 1.775725 |
| 9 | 12 | 119948 | 092D25BAD64832AE3F69488573BA5C398C25B51D | 2014-01-01 01:08:18 | 2014-01-01 01:13:02 | Isestraße / Hoheluftbrücke | 140804 | Eppendorfer Weg/Hoheluftchaussee | 198086 | IVR | 5 | 0.555314 |
Last rows
| df_index | bike_id | user_id | date_from | date_until | start_station_name | start_station_id | end_station_name | end_station_id | booked_via | duration_in_min | distance_in_km | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 7669940 | 16228280 | 108747 | B4E335F11DC342724FD0C5208AF2B0AC89D2B581 | 2017-05-15 22:36:58 | 2017-05-15 22:46:49 | Alsenstraße/Düppelstraße | 211706 | Osterstraße/Bismarckstraße | 131642 | Terminal HH_95 (-2134-) | 10 | 1.463426 |
| 7669941 | 16228281 | 109223 | E83B9222C38C025523BE16AB2F00530522EF93CC | 2017-05-15 22:38:44 | 2017-05-15 22:44:26 | Wiesendamm/Roggenkamp | 212607 | Schleidenstraße/Osterbekstraße | 208307 | iPhone SRH | 6 | 0.670447 |
| 7669942 | 16228282 | 116034 | 6F18F52C068612DC5AE3530A451FC80222F7B4C9 | 2017-05-15 22:45:13 | 2017-05-15 22:59:26 | Eimsbütteler Straße/Waterloostraße | 131644 | Lappenbergsallee / Bei der Apostelkirche | 243618 | Android CAB | 15 | 1.198705 |
| 7669943 | 16228283 | 117564 | 4C3C86C70B705E7075399845BFDF571548A50B90 | 2017-05-15 22:45:47 | 2017-05-15 23:00:21 | Neumühlen/Övelgönne | 213856 | Bahnhof Altona West / Busbahnhof | 131889 | Techniker HH_135 (-2151-) | 15 | 1.518474 |
| 7669944 | 16228288 | 120311 | C639C55CFF8334C7C7983E278A9B257B044F00F1 | 2017-05-15 23:44:43 | 2017-05-16 00:13:44 | Goldbekplatz / Semperstraße | 140796 | Berliner Tor / Berlinertordamm | 131652 | Terminal HH_73 (-2363-) | 30 | 3.529680 |
| 7669945 | 16228289 | 143621 | FF0963FE7D54E9455B5CF1ADE5DFEF484F8C525F | 2017-05-16 01:11:33 | 2017-05-16 01:32:16 | Schulterblatt/Eifflerstraße | 131648 | Fischersallee/Bleickenallee | 211711 | Unknown | 21 | 2.880246 |
| 7669946 | 16228290 | 109115 | FF7147E7A3583564085352944933642F67C4D755 | 2017-05-16 03:25:09 | 2017-05-16 03:31:05 | Königstraße / Struenseestraße | 131650 | Große Rainstraße/Ottenser Hauptstraße | 244943 | iPhone SRH | 6 | 0.989741 |
| 7669947 | 16228291 | 116255 | 5BB54A7EBCD7A5A88FD410A537E10160BA120BB2 | 2017-05-16 07:15:40 | 2017-05-16 07:19:49 | Heußweg/Wiesenstraße | 201326 | Lappenbergsallee / Bei der Apostelkirche | 243618 | Terminal HH_11 (-2225-) | 5 | 0.620216 |
| 7669948 | 16228293 | 119663 | 1024F6970D5BE146588D64F6AF427E147ADC642E | 2017-05-16 07:36:36 | 2017-05-16 07:44:16 | Bahnhof Altona Ost/Max-Brauer-Allee | 131646 | Neuer Pferdemarkt / Beim Grünen Jäger | 131890 | iPhone SRH | 8 | 1.990734 |
| 7669949 | 16228295 | 120488 | CC6405146B51242A9169AB55E88A5C472EA1B2AA | 2017-05-16 07:40:17 | 2017-05-16 07:50:07 | Weidestraße/Biedermannplatz | 211922 | Mundsburg / Schürbeker Straße | 140799 | Techniker HH_119 (-2334-) | 10 | 1.241150 |